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METHOD AND APPARATUS FOR 
SHARING DATA STRUCTURES BETWEEN 
ASSEMBLY LANGUAGE PROGRAMS AND 
HIGH-LEVEL LANGUAGE PROGRAMS 

5 

FIELD OF THE INVENTION 

The present invention generally relates to sharing data between assembly 
language computer program code and high-level language code, and more particularly to 
10 a method for sharing data structures between assembly code and high-level code. 

BACKGROUND OF THE INVENTION 

Low-level system software, for example, a BIOS, has historically been written in 
assembly language. As systems have evolved and become more complex, additional 

15 functions have been developed using a high-level language, for example C. Since access 
to certain instructions and/or registers is generally unavailable in a high-level language, 
assembly language code is likely to remain in use. 

It is common in applications that include assembly code and high-level code to 
reference the same data structure in both the assembly code and in the high-level code. 

20 To support the references in the different languages, however, requires special measures 
since high-level language references are unsupported in assembly language code. 

One approach for dealing with references to data structures in applications having 
assembly and high-level code uses a utility to convert a file having a high-level language 
data structure into the target assembly language. This approach is limited by the 

25 capabilities of the target assembly language to define data structures. Thus, with some 
assemblers, only simple data structures may be shared between the assembly and high- 
level code. 

Another approach uses a utility to translate high-level language data structures 
into assembly "equate" statements. This approach requires the generation of equate 
30 statements for all addressable elements of a data structure. Thus, for complex data 
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structures, for example, nested structures and arrays of structures, generation of suitable 
equate statements is impractical. 

A method and apparatus that addresses the aforementioned problems, as well as 
other related problems, are therefore desirable. 

5 

SUMMARY OF THE INVENTION 

In various embodiments, the invention provides a method and apparatus for 
sharing one or more high-level language data structures between an assembly language 
program and a high-level language program. In one embodiment, data structures that 

10 include one or more elements are defined in both an assembly language source code file 
and in high-level language source code file, with the definitions in the assembly language 
source code file being in the high-level language. The storage requirements of the data 
structures are determined from the definitions set forth in the assembly source file and the 
definitions are removed from the assembly source file prior to processing by an 

15 assembler. Memory allocation directives that are included in the assembly source file 
specify memory addresses of the data structures and are removed from the assembly 
source program prior to processing by the assembler. The assembly source file includes 
references to elements of the data structures in the form of substitution directives. Using 
the memory allocation directives and the data structure definitions, the substitution 

20 directives are replaced with memory addresses or element sizes prior to processing by the 
assembler. 

The above summary of the present invention is not intended to describe each 
disclosed embodiment of the present invention. The figures and detailed description that 
follow provide additional example embodiments and aspects of the present invention. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other aspects and advantages of the invention will become apparent upon review 
of the Detailed Description and upon reference to the drawings in which: 

FIG. 1 is a data flow diagram illustrating the transformation of assembly code 
30 having high-level definitions of data structures and references thereto; 
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FIG. 2 is a flowchart of a process for providing data structure access to both 
assembly language code and to code written in a high-level language; and 

FIG, 3 is a flowchart of the pre-processing performed on an assembly language 
source file in accordance with one embodiment of the present invention. 
5 While the invention is susceptible to various modifications and alternative forms, 

specific embodiments thereof have been shown by way of example in the drawings and 
will herein be described in detail It should be understood, however, that the detailed 
description is not intended to limit the invention to the particular forms disclosed. On the 
contrary, the intention is to cover all modifications, equivalents, and alternatives falling 
10 within the spirit and scope of the invention as defined by the appended claims. 

DETAILED DESCRIPTION 

In various embodiments, the invention supports references to data structures in 
both assembly language code and in high-level language code. The data structures to be 

15 referenced are defined in the high-level language in both the assembly code and in the 
high-level code. The assembly language code includes allocation directives and 
substitution directives for allocating memory and for resolving references to the data 
structures. The assembly code is processed prior to assembly ("pre-processed") to 
replace the substitution directives with addresses and/or sizes of the data structures based 

20 on the allocation directives and type definitions of the data structures. The pre- 
processing of the assembly code removes the data structure definitions and allocation 
directives from the source code. The resulting assembly code can then be assembled 
since it includes constructs recognized by the assembler. The high-level source code 
includes the same definitions of data structures and can be compiled normally. The 

25 object code from the assembler and compiler can then be linked into an executable 
segment of code. 

FIG, 1 is a data flow diagram illustrating the transformation of assembly code 
having high-level definitions of data structures and references thereto. Block 102 
represents a source code file of assembly code, and block 104 represents a high-level 
30 language source code file. Both assembly source file 102 and high-level source file 104 
include definitions of one or more shared data structures. For example, in the C 
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language, the reserved word "typedef is used to begin the definition of a data type, 

which may be a language supported data structure (e.g., structures, arrays, nested 

structures, arrays of structures, etc.) or a hierarchy built with language supported data 

structures. Two typedefs are illustrated in the assembly source file 102. It will be 

5 appreciated that "include" files may also be used as recognized by those skilled in the art. 

Example type definitions are set forth in the code below. 

typedef struct { 
long handle; 
unsigned short vars[8]; 
10 }s1Type; 

typedef struct { 
char id[4]; 
sUype s1ltem[10]; 
15 } s2Type; 

#define S2BASE 0x1800 

The #define statement is used to equate the label s2BASE with the hexadecimal value 

0x1800. This allows the label s2BASE to be used in the assembly code instead of the 

20 hexadecimal value. 

In addition to the data structure definitions, the assembly source file 102 also 
includes allocation directives and substitution directives. Allocation directives are used 
to define the memory locations of the data structures. In one embodiment, an allocation 
directive has the following syntax: 

25 #alloctypename var <array> @ address 

where typename is the name specified in the typedef statement, var is the variable name, 

which may optionally be an array, and address is the absolute hexadecimal address at 

which the data structure is to be allocated. Example allocation directives for the example 

typedefs set forth above are set forth in the code below. 

30 #alloc sUype v1[3] @ 0x1700 

#alloc s2Type v2[30] @ S2BASE 

A substitution directive sets forth a hierarchical reference to an element within a 
data structure and includes a request for either the element's size or address. In one 
35 embodiment, a substitution directive has the following syntax: 
#(subs-directive reference)* 
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where subs-directive is one of "addrof ' , "sizeof , or "offsetof \ The addrof directive 

requests the absolute address of the referenced element, the sizeof directive requests the 

size of the referenced element, and the offsetof directive requests the offset of the 

referenced element relative to the base address of the structure. The "reference" sets 

5 forth a hierarchical reference to the desired element similar to a high-level language 

reference to the element. Example substitution directives are set forth in the code below: 

mov eax, #(addrof v1 [2]. handle)* ; 

mov si, #(offsetof v1[2].handle)# ; 

mov cx, #(sizeof v1 [2]. handle)* ; 

10 mov cx, #(sizeof v2[10].s1ltem[0].vars[3])# ; 

mov ebx, #(addrof v2[10].s1ltem[2].vars[3])# ; 

mov edx, #(offsetof v2[l0].s1ltem[2].vars[3])# ; 

In pre-processing the assembly language source file 102, the substitution 

15 directives are replaced with the addresses and sizes as specified. For example, the 

assembly language source code file 106 would not have the type definitions, allocation 

directives, and substitution directives from the original assembly source file 102. The 

substitution directives would be replaced with the appropriate addresses and sizes. 

Example assembly code resulting from the replacement of the substitution directives in 

20 the example code above is set forth in the code below. 

mov eax, 1728h 
mov si, 28h 
mov cx, 4h 
mov cx, 2h 
25 mov ebx, 202Eh 

mov edx, 82Eh 

Once the assembly source file 102 has been pre-processed into assembly source 

file 106, source 106 is assembled into object code 108. The high-level language source 

30 file 104 is compiled into object code 110. The example code below illustrates code 

included in high-level source file 104 for referencing the data structures described above. 

s1 Type *sl Ptr = (s1 Type *)0x1 700; 
s2Type *s2Ptr = (s2Type *)s2BASE; 



35 



(s1Ptr+2)->handie = 0x1111; 



5 



RA-5356 



Express Mail EL027383710US 



(s2Ptr+10)->s1 ltem[2].vars[3] = 3; 

size = sizeof(s2Ptr->s1 ltem[0].vars[3]); 

offset = (int)(&((s2Ptr+10)->s1 ltem[2].vars[3])) - (int)s2Ptr; 

5 Object code 108 and 1 10 are then linked to form executable code 112. 

FIG. 2 is a flowchart of a process for providing data structure access to both 
assembly language code and to code written in a high-level language. At step 202, 
definitions of the data structure are specified in both the assembly language code and in 

10 the high-level language code. In one embodiment, the data structure definitions in the 
assembly language code conform to the syntax of the high-level language. At step 204, 
the assembly language code is programmed to include allocation directives for the data 
structures. The allocation directives specify the absolute addresses of the data structures. 
To reference elements of the data structures in the assembly code, substitution 

15 directives are specified in combination with the assembly code, as shown by step 206. It 
will be appreciated that the high-level source code can be conventionally programmed to 
reference the elements of the data structures. 

At step 208, the assembly language source code is pre-processed to remove the 
data structure definitions and allocation directives, and replace the substitution directives 

20 with the appropriate sizes and addresses. After pre-processing, the assembly language 
code is processed by a conventional assembler at step 210. At step 212, the high-level 
language code is compiled, and at step 214 the object code from the assembler and 
compiler are linked into an executable program. The executable program can then be run 
on a suitable computer. 

25 

FIG. 3 is a flowchart of the pre-processing performed on an assembly language 
source file in accordance with one embodiment of the present invention. At step 252, the 
high-level type definitions are parsed to determine the storage requirements of the 
elements of the data structures. For example, the size of a structure can be determined 
30 from the constituent elements, for example, character strings, integers, floating point 
numbers, arrays, etc. The definitions are also removed from the assembly source code. 
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At step 254, the data structure addresses and sizes are read from the allocation 
directives in the assembly source code. The addresses are the absolute addresses of the 
data structures and the sizes are array sizes, for example. The addresses and storage 
requirements determined at steps 252 and 254 are then used in processing the substitution 
5 directives. 

At step 256, the substitution directives are replaced with either a size or an 
address, depending on the specified directive (i.e. 5 addrof, offsetof, or sizeof). Using the 
data structure size information obtained from the definitions and the address information 
from the allocation directives, the appropriate addresses can be computed. Once the 
10 substitution directives have been replaced, pre-processing of the assembly source file is 
complete. 

Accordingly, the present invention provides, among other aspects, a method and 
apparatus for sharing data structures between assembly language source code and high- 
level language source code. Other aspects and embodiments of the present invention will 
15 be apparent to those skilled in the art from consideration of the specification and practice 
of the invention disclosed herein. It is intended that the specification and illustrated 
embodiments be considered as examples only, with a true scope and spirit of the 
invention being indicated by the following claims. 
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